Your browser doesn't support javascript.
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
1.
Bioinformatics Research and Applications, Isbra 2022 ; 13760:369-380, 2022.
Artículo en Inglés | Web of Science | ID: covidwho-2309148

RESUMEN

Clustering viral sequences allows us to characterize the composition and structure of intrahost and interhost viral populations, which play a crucial role in disease progression and epidemic spread. In this paper we propose and validate a new entropy based method for clustering aligned viral sequences considered as categorical data. The method finds a homogeneous clustering by minimizing information entropy rather than distance between sequences in the same cluster. We have applied our entropy based clustering method to SARS-CoV-2 viral sequencing data. We report the information content extracted from the sequences by entropy based clustering. Our method converges to similar minimum-entropy clusterings across different runs and limited permutations of data. We also show that a parallelized version of our tool is scalable to very large SARS-CoV-2 datasets.

2.
Diseases ; 11(2)2023 Mar 31.
Artículo en Inglés | MEDLINE | ID: covidwho-2299108

RESUMEN

During the COVID-19 pandemic caused by SARS-CoV-2, new waves have been associated with new variants and have the potential to escape vaccinations. Therefore, it is useful to conduct retrospective genomic surveillance research. Herein, we present a detailed analysis of 88 SARS-CoV-2 genomes belonging to samples taken from COVID-19 patients from October 2020 to April 2021 at the "Reina Sofía" Hospital (Murcia, Spain) focused to variant appeared later. The results at the mentioned stage show the turning point since the 20E (EU1) variant was still prevalent (71.6%), but Alpha was bursting to 14.8%. Concern mutations have been found in 5 genomes classified as 20E (EU1), which were not characteristic of this still little evolved variant. Most of those mutations are found in the spike protein, namely Δ69-70, E484K, Q675H and P681H. However, a relevant deletion in ORF1a at positions 3675-3677 was also identified. These mutations have been reported in many later SARS-CoV-2 lineages, including Omicron. Taken together, our data suggest that preferential emergence mutations could already be present in the early converging evolution. Aside from this, the molecular information has been contrasted with clinical data. Statistical analyses suggest that the correlation between age and severity criteria is significantly higher in the viral samples with more accumulated changes.

3.
18th International Symposium on Bioinformatics Research and Applications, ISBRA 2022 ; 13760 LNBI:369-380, 2022.
Artículo en Inglés | Scopus | ID: covidwho-2265112

RESUMEN

Clustering viral sequences allows us to characterize the composition and structure of intrahost and interhost viral populations, which play a crucial role in disease progression and epidemic spread. In this paper we propose and validate a new entropy based method for clustering aligned viral sequences considered as categorical data. The method finds a homogeneous clustering by minimizing information entropy rather than distance between sequences in the same cluster. We have applied our entropy based clustering method to SARS-CoV-2 viral sequencing data. We report the information content extracted from the sequences by entropy based clustering. Our method converges to similar minimum-entropy clusterings across different runs and limited permutations of data. We also show that a parallelized version of our tool is scalable to very large SARS-CoV-2 datasets. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.

4.
J Comput Biol ; 29(5): 453-464, 2022 05.
Artículo en Inglés | MEDLINE | ID: covidwho-1758600

RESUMEN

In this work, we investigate using Fourier coefficients (FCs) for capturing useful information about viral sequences in a computationally efficient and compact manner. Specifically, we extract geographic submission location from SARS-CoV-2 sequence headers submitted to the GISAID Initiative, calculate corresponding FCs, and use the FCs to classify these sequences according to geographic location. We show that the FCs serve as useful numerical summaries for sequences that allow manipulation, identification, and differentiation via classical mathematical and statistical methods that are not readily applicable for character strings. Further, we argue that subsets of the FCs may be usable for the same purposes, which results in a reduction in storage requirements. We conclude by offering extensions of the research and potential future directions for subsequent analyses, such as the use of other series transforms for discreetly indexed signals such as genomes.


Asunto(s)
COVID-19 , SARS-CoV-2 , Benchmarking , Genoma Viral , Humanos , Filogenia , SARS-CoV-2/genética
5.
Infect Genet Evol ; 88: 104708, 2021 03.
Artículo en Inglés | MEDLINE | ID: covidwho-1039486

RESUMEN

The pandemic due to novel coronavirus, SARS-CoV-2 is a serious global concern now. More than thousand new COVID-19 infections are getting reported daily for this virus across the globe. Thus, the medical research communities are trying to find the remedy to restrict the spreading of this virus, while the vaccine development work is still under research in parallel. In such critical situation, not only the medical research community, but also the scientists in different fields like microbiology, pharmacy, bioinformatics and data science are also sharing effort to accelerate the process of vaccine development, virus prediction, forecasting the transmissible probability and reproduction cases of virus for social awareness. With the similar context, in this article, we have studied sequence variability of the virus primarily focusing on three aspects: (a) sequence variability among SARS-CoV-1, MERS-CoV and SARS-CoV-2 in human host, which are in the same coronavirus family, (b) sequence variability of SARS-CoV-2 in human host for 54 different countries and (c) sequence variability between coronavirus family and country specific SARS-CoV-2 sequences in human host. For this purpose, as a case study, we have performed topological analysis of 2391 global genomic sequences of SARS-CoV-2 in association with SARS-CoV-1 and MERS-CoV using an integrated semi-alignment based computational technique. The results of the semi-alignment based technique are experimentally and statistically found similar to alignment based technique and computationally faster. Moreover, the outcome of this analysis can help to identify the nations with homogeneous SARS-CoV-2 sequences, so that same vaccine can be applied to their heterogeneous human population.


Asunto(s)
COVID-19/epidemiología , Infecciones por Coronavirus/epidemiología , Variación Genética , Genoma Viral , Pandemias , SARS-CoV-2/genética , Síndrome Respiratorio Agudo Grave/epidemiología , África/epidemiología , Américas/epidemiología , Asia/epidemiología , Australia/epidemiología , Secuencia de Bases , COVID-19/transmisión , COVID-19/virología , Biología Computacional/métodos , Infecciones por Coronavirus/transmisión , Infecciones por Coronavirus/virología , Europa (Continente)/epidemiología , Interacciones Huésped-Patógeno/genética , Humanos , Coronavirus del Síndrome Respiratorio de Oriente Medio/genética , Coronavirus del Síndrome Respiratorio de Oriente Medio/patogenicidad , Coronavirus Relacionado al Síndrome Respiratorio Agudo Severo/genética , Coronavirus Relacionado al Síndrome Respiratorio Agudo Severo/patogenicidad , SARS-CoV-2/patogenicidad , Alineación de Secuencia , Síndrome Respiratorio Agudo Grave/transmisión , Síndrome Respiratorio Agudo Grave/virología
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA